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Method for screening the allelic state a-t tiie 5^-fl ankxng 
region of the aSl casein gene 



The invention refers to a genetic marker at the 5' -flanking 
region of the aSl casein gene {CSNISI) and the casein gene 
complex as well as a method to classify cattle, independent 
of age and lactation, through determination of the allelic 
state within this area as well as the application of this 
method to select organisms with a preferred allele, for 
instance in the marker-supported selection. 

Technical state of the art 

The hereditary potential of breed animals (regarding the 
milk protein content and other characteristics relevant to 
the breeding) is estimated at present through estimating the 
breeding value based on test matings and performance records 
of the descendants. The disadvantage of this conventional 
procedure is obvious. For cattle, it takes approx. 3 years 
from the first insemination by a test bull until the first 
daughters begin lactating, thus approx. 4 years until the 
registration of a complete lactation of the daughter. Only 
thereafter can the breed value be estimated. Until then, 
from both maintaining the bulls until the first estimated 
breeding values are available and the test mating, costs 
arise, which are substantial during this long period and due 
to the total amount of animals. This applies analogically to 
the registration of the own contribution and the 
determination of a breed value of cows. 

Therefore, for some years, with the help of the progress in 
the genome analysis, international efforts have been 
undertaken to develop genetic markers and direct gene tests 
to identify performance parameters relevant to breeding . 
Hereby, genome-wide analysis of markers have made possible, 
with the help of the linkage analysis, the approximation to 




the chromosome range, in which the gene locations are 
situated, which are relevant to the determination of 
performance, the so-called QTL regions (QTL = quantitative 
trait loci) . Such QTL studies and the resulting tests are 
5 described among others in the WO 20000 36143 and the 
WO 2001 57250 A2/A3. Further details of the QTL analysis of 
farm animals as well as a procedure based on QTL studies to 
also isolate causal candidate genes, are described in 
DE 100 17 675 Al . The disclosure of the invention is 

10 included therein. 

Regarding cattle and other species bred for milk production, 
the crucial criteria are the milk quantity, the protein 
content and fat. For these criteria, different QTL were 
identified, among other locations on the chromosome BTA 6. 

15 The potential QTL regions for protein contents are indicated 
relatively uniformly from different working groups within 
the area around or between the micro satellite markers BM143 
and TGLA37 and thus approximately 20-30 centimorgans (cM) 
away from the casein locus (Spelman et al. 1996, Genetics 

20 144, 1799-1808; Georges et al. 1995, Genetics 139, 907-920; 
Boldly et al, 1996, J Anim Breed Genet 133, 355-362; Zhang 
et al. 1998, Genetics 149, 1959-1973). According to 
Nadesalingam et al. (2001, Mammalian Genome 12, 27-31) the 
casein genes are, however, as well excluded as candidates 

25 for the observed QTL effects due to their position (40cM 
away from the QTL) . 

Since the mid 80' s, genetically conditioned milk protein 
varieties of cattle have been analysed with regard to an 
influence on milk yield and quality characteristics. This 

30 was realized partly by registering (in the milk) the 
phenotipically distinguishable protein variants (Ng-Kwai - 
Hang et al., 1984, J Dairy Sci 67, 835-840 and Ng-Kwai-Hang 
et al., 1986, J Dairy Sci 69, 22-26,), later by means of 
molecular genetic processes, which provided evidence of the 

35 genetic mutations upon which the protein variants are based 
(Sabour et al., 1996, J Dairy Sci 79, 1050-1056). Studies up 




to now partly reveal conflicting results of the examined 
variants which can not always be confirmed for different 
breeds and regional origins (summarised by Prinzenberg, 
1998, ISBN 3-922306-68-3, chapter 2.4, p. 14-21). The 
5 majority of these studies are concentrated on variants of 
the p-lactoglobulin, and the p- and K-casein, since in the 
asl casein, the frequency of only two protein variants is 
worth mentioning and in particular, in the already strongly 
selected milk breeds like Holstein Friesian / German 

10 Holstein, the protein variant asl casein B can be almost 
exclusively found (Ng-Kwai-Hang et al . , 1990, J Dairy Sci 
73, 3414-3420; Erhardt et al., 1993, J Animal Breed Genet 
36, 145-152; Lien et al., 1999, Animal Genetics 30, 85-91). 
In a more recent study of dairy cattle with different 

15 proportions of Holstein blood (Freyer et al., 1999, J Animal 
Breed Genet 116, 87-97), asl casein was also not used in the 
linkage analysis due to the lack of variability. 
Various tests are described concerning the molecular genetic 
differentiation of the asl casein variants B and C (David & 

20 Deutch 1992, Animal Genetics 23, 425-429; Schlee & Rottmann 
1992, J Anim Breed Genet 109, 316-319) . Individual gene test 
procedures for the rare alleles A, D and F also exist, 
(Prinzenberg 1998, ISBN 3-922306-68-3; chapter 4.1, p. 61- 
71) , as well as for the proof of a quantitative variant of 

25 the asl casein G (Mariani et al 1995, industria del Latte 
31, 3-13). By means of sequencing around 1,000 base pairs 
(bp) from the 5' -region of the asl casein gene from various 
cattle breeds, Schild & Geldermann (1996) showed 17 variable 
positions in the 5' -flanking region of the CSNISI gene, of 

30 which 5 have been detected due to different recognition 
sequences for the restriction endonucleases with Polymerase 
Chain Reaction-Restriction Fragment Length Polymorphism 
(PCR-RFLP) . According to Ehrmann et al. (1997, J Animal 
Breed Genet 114, 121-132), the 5' -flanking variants are each 

35 linked with certain protein alleles, in such a manner that 




the existence of given protein variants implies the 
existence of certain variants in the 5' -flanking region. 
Koczan et al. (1993, Animal Genetics 24, 74) also described 
a gene test to discriminate the asl casein B against C in 
5 German American Holstein, Black and White, and Jersey cows 
which is based on a fragment from the 5' -flanking region of 
the asl casein. For the last test mentioned, the strict 
linkage with the protein mutations asl casein B and C has 
however in the meantime been refuted and therewith, the 

10 validity for the following breeds: Aberdeen Angus, Anatolian 
black, Angeln, Asturian Valley, Ayrshire, British Frisian, 
Casta Navarra, Charolais, Chianina, Fighting Bull, Hereford, 
Jersey, Maremmana, Pezzata Rossa, Piedmontese, Scottish 
Highland, Turkish Grey Steppe (Jann et al., 2001; Arch. 

15 rierz, Dummerstorf 45, 13-21) . 

The casein genes are mapped as a closely linked gene locus 
in cattle and sheep at chromosome 6, in humans in chromosome 
4, and in mice at chromosome 5. The linking of casein genes 
has also been proven for other animal species (rabbit, pig, 

20 goat) . Due to this close linkage, the present designation of 
the site of the asl casein gene in the genetic map of cattle 
is linked to the site of the K-casein gene. In the up-to-date 
gene map for cattle, the physical position BTA6q31-33, and 
the genetic position 82.6 cM (MARC97) and 103.0 cM (IBRP97) 

25 resp., are stated for both genes. For this reason, the 
recombination rate between the asl casein and K-casein gene 
is assumed to be zero. 

The utilization of a lactalbumin sequence for the selection 
30 of breeding animals is revealed in EP0555435. Likewise, for 

the bovine K-casein, there exist numerous gene tests 

(Denicourt et al., 1990, Animal Genetics 21, 215-216; 

Medrano & Aguilar-Cordova, 1990, Biotechnology 8, 144-145; 

Pinder et al., 1991, Animal Genetics 22, 11-20; Schlee & 
35 Rottmann, 1992, J Animal Breed. Genet, 109, 153-155;. 
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Zadworny & Kuhnlein, 1990, Theor. Appl. Genet. 80, 631-634), 
since influences on the processing characteristics and 
cheese producing ability of the milk are attributed to this 
protein (see Lodes et al., 1996, Milchwissenschaft 51, 368- 
5 373 and 543-548) . 

Due to the mammary gland-specific expression, the promoters 
of bovine milk protein genes and also the asl casein gene 
are utilized in the creation of transgenic animals and for 
expression in cell cultures. DE 38 54 555 T2, the content of 

10 which is referred to here, describes the utilization of the 
asl casein promoter and signal peptide for the production of 
recombinant proteins in the milk of mammals. Rudolf also 
gives an overview of the use of transgenic animals for the 
production of recombinant proteins and the promoters used 

15 for this purpose. (1999, Trends in Biotechnology (TIBTECH) 
17, 367-374) . 

Winter et al. (2002, PNAS 99, 9300-9305) describe a direct 
gene test for a gene from the fatty acid metabolism {DGATl) 
and attribute an effect on the milk fat content to this 
20 gene. 

The disadvantage of all procedures which phenotypically 
differentiate (namely within milk samples) the genetic 
variants, based on a milk sample, lies in the fact that 

25 lactating cows can be studied exclusively. Thus, there 
exists the need to be able to examine the animal independent 
of lactation. Furthermore, the proven polymorphisms in the 
region of the milk protein gene to be coded do not represent 
a reliable marker for milk performance traits according to 

30 the current state of the art. The existing QTL analyses 
point to a QTL 1 which lies outside the milk protein genes. 
For asl casein, there is currently no marker available with 
sufficient variability, causing the fact that this gene's 
effects on lactating performance and content characteristics 

35 can almost not be studied. All available test procedures 

5 




depend on the molecular genetic differentiation of the also 
phenotypically available variation. 

The micro satellite markers, which was ascertained through 
QTL analyses, are only suited for conditional use in the 
5 marker protected selection, because the respective marker- 
QTL-linkage must first be explained. In each case, with 
these microsatellite markers it is a matter of indirect 
tests which, depending on the closeness of the linkage to 
the causal gene location, have less reliable results. 

10 The disadvantage of the procedures in EP 0555435 lies in the 
fact that a-lactalbumin only makes up a small portion (ca. 
2-5%) of the entire milk protein. At approx. 80%, the 
caseins (asl-, as2-, p- and K-casein) form the largest 
portion of the total protein. Thus, by applying these 

15 selection markers, only minor breeding progress is to be 
. expected. 

The gene test for DGATl from Winter et al. has the 
disadvantage that, from the perspective of the breeder and 
milk producer, the milk fat content is not of primary 
20 interest, but rather takes second place behind the protein 
content . 

The disadvantage of the procedures in DE 38 54 555 T2 lies 
in the fact that the utilized portion of the asl casein 
promoter is not more closely characterized by means of a 

25 nucleotide sequence. A 9kb fragment is utilized containing 
exons I and II, terminated with recognition sites for Kpnl 
and BamHl. No consideration of the exact base effect or 
possible variations takes place which can influence the 
effectiveness of the expression with this segment of the 

30 promoter. 

Currently, there are no reliable markers for milk protein 
content and no direct genetic test for a functional gene 
segment in order to test an animal's genetic potential, 
independent of age and lactation. 

35 
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Problems of 1:he Invezi'tlon 

Hence, it is the problem of the invention to make available 
a genetic marker and a procedure for the classification of 
milk production traits in order to examine an animal's milk 
5 production traits by means of their genetic material, 
independently of age and lactation. 

This problem is solved by making available, within the 
region of the asl casein gene, a marker which remains 
polymorphic and genetic also within selected milk breeds, 
10 and through a procedure which enables the classification of 
the animals independently of age and lactation, the genetic 
mapping of the asl casein gene, the examination of effects 
which are either closely linked with this gene location or 
thereby directly caused, as well as a breeding utilization. 

15 Based on the invention, the procedure refers to a genetic 
test for a functional gene segment, the reliability of the 
results is greater than with linkage markers and the test 
result is available within a few days to a few hours, 
whereby the substantial costs of test mating can be reduced. 

20 Thus, the procedure based on the invention eliminates the 
described disadvantages in the technical state of the art. 

With the marker based on the invention the selection of 
especially advantageous promoters for the production of 
expression vectors and transgenic animals is possible. 

25 In an especially advantageous practical embodiment, the 
invention consists of a test kit, which contains the 
oligonucleotides for the enrichment of a segment of the 
marker sequence of the asl casein gene, preferably the 
primer 1 CSNlSlprolf (5' GAA TGA ATG AAC TAG TTA CC 3'), 

30 primer 2 CSNlSlprolr (5' GAA GAA GCA GCA AGC TGG 3') and 
primer 3 CSNlSlpro2r (5' CCT TGA AAT ATT CTA CCA G 3' ) as 
well as reference probes for one or more sequences of the 
marker sequence of the asl casein gene and alleles thereof. 
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The following figures are enclosed with the description: 
Figure 1 DNA-sequence from the 5' -flanking region of the asl 
casein gene, in the following designated as marker sequence 
Figure 2 Alignment of the nucleic acid sequences of the 
5 allelic state of the asl casein gene allele 1^ allele 2, 
allele 3, allele 4 (differences in potential transcription 
factor-interfaces are highlighted) 

Figure 3 Schematic representation of the migration pattern 
of the alleles 1 to 4 of the marker CSNISI in the analysis 
10 SSCP. 

Figure 4 Result of the variance analysis 

It was surprisingly discovered that the examined sequence 
segment, which is flanked by the oligonucleotides 

15 CSNlSlprolf and CSNlSlprolr or CSNlSlpro2r (grey box in 
figure 1) within the breed German Holstein, contains four 
alleles which were detectable through a single-strand 
conformation polymorphism analysis and thus is sufficiently 
polymorphic in order to realize a genetic mapping and 

20 analysis concerning the effects of the alleles on the milk 
performance parameters . 

This concerns a fragment of 1061 bp within the 5' -flanking 
region and the exon 1 (refer to figure 1), in particular the 
fragment of 654 bp which is flanked by the two 

25 oligonucleotides CSnlSlprolf and CSNlSlprolr. 

The four alleles were cloned and sequenced. The sequence 
analysis was in accordance with the sequence published by 
Koczan et al. (1991, Nucleic Acids Research 19, 5591-56596; 
Genbank Acc. No. X59856) for allele 2, except for the length 

30 of poly-T (from position 390 of figure 1 onwards) . The 
alleles 1, 3 and 4 differ from this sequence by various 
substitutions and deletions. The variable positions are 
highlighted in the sequence alignment (figure 2) . In alleles 
1 and 4, potential transcription factor-binding sites are 

35 each affected by mutations. Thus, in allele 1, two potential 
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binding sites (for AP-1 and YYl) cease to exist, whereas in 
allele 4, a new potential ABFl-binding site emerges. 
The polymorphism found is therefore located in a supposedly 
functional gene region and thus, is a suitable marker for 
5 milk production traits, in particular for the protein 
content . 

Based on the current invention, the sequence fragment is 
flanked by the following oligonucleotide sequence, which is 
utilized as a primer for amplification by means of PGR, 
10 whereby the combinations Primer 1 with Primer 2, and Primer 
1 with Primer 3 are possible: 

Primer 1: CSNlSlprolf (5' GAA TGA ATG AAC TAG TTA CC 3') 
Primer 2: CSNlSlprolr (5' G7\A GAA GCA GCA AGC TGG 3') 
Primer 3: CSNlSlpro2r (5' CCT TGA AAT ATT CTA CCA G 3') 

15 

The primer binding sites are shaded grey in figure 1. 

A procedure, based on the invention, is made available, 
which can be carried out directly at the hereditary material 
20 of the organism to be examined. With the help of the marker 

based on the current invention, a genetic mapping of the asl 
casein gene within the linkage map is made possible and the 
determination of the allelic condition in individual 
organisms, e.g. cattle, is undertaken, which determines 
25 within a few hours the genetic potential with regard to milk 
protein content. 

The procedure for determining the genetic potential with 
regard to milk protein content by determining the allelic 
condition of the marker based on the current invention, in 
30 detail, consists of: 

1. Making available the genetic material of the organism to 
be examined, from male or female breeding cattle or an 
embryo thereof. 

9 




The organism is, by definition, an animal, particularly a 
mammal, in particular a bovine, a sheep or a goat, including 
embryos of these species. 

The organism is also a genetically modified organism (GMO) , 
5 which contains the described sequence fragment of the 5'- 
flanking region (figure 1) and of the asl casein gene or 
parts thereof . 

The genetic material is, by definition, genomic DNA or RNA 
from animals, but also plasmid DNA from bacteria, from 
10 artificial chromosomes such as BACs and YACs or 
constructions created from genetic material of various 
organisms for specific applications, e.g. for the production 
of transgenic organisms. 

The source material for the extraction of material 
15 containing DNA or RNA is namely blood, leukocytes, tissue 
including biopsy material, milk, sperm, hair, several cells 
including cell material from embryos, a bacteria culture or 
isolated chromosomes. Furthermore, genetic material already 
amplified beforehand, which contains the marker sequence 
20 (figure 1) or parts thereof, is again source material. 



2. Selective isolation or enrichment of the sequence 
fragments of figure 1, or a sequence which contains portions 
thereof, preferably the illustrated sequence fragment from 
25 position 1 to 654 of figure 1. 

The isolation of genetic material is achieved by standard 
methods as they are described in the handbook "Molecular 
Cloning" (Sambrook, Fritsch, Maniatis, 1989; Cold Spring 
Harbour Laboratory Press, New York) , or can be carried out 
30 by using commercially obtainable kits (e.g. Nucleospin, 
Machery Nagel, Diiren, Deutschland) . 

The enrichment is achieved preferably by means of polymerase 
chain-reaction (PCR, Mullis & Fallqona, 1987, Methods in 
Enzymology 155, 335-350), whereby f luorescently marked. 
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radioactively marked, or chemically marked primers can also 
be utilized. When using RNA as genetic material, a reverse 
transcription must be carried out beforehand (Myers & 
Gelfand 1991, Biochemistry 30, 7661-7666) . 

The sequence fragment is enhanced preferably by the 
following oligonucleotide sequences based on the current 
invention, which are utilized as a primer for the 
amplification by means of PGR, whereby the combinations 
primer 1 with primer 2, and primer 1 with primer 3 are 
possible 

The sequence fragment is enhanced preferably with the 
following oligonucleotide sequences, based on the current 
invention, as primer for the amplification, whereby the 
combinations Primer 1 with Primer 2 and Primer 1 with Primer 
3 are possible: 

Primer 1: CSNlSlprolf (5' GAA TGA ATG AAC TAG TTA CC 3') 

Primer 2: CSNlSlprolr (5^ GAA GAA GCA GCA AGC TGG 3') 

Primer 3: CSNlSlpro2r (5' CCT TGA AAT ATT CTA CCA G 3') 
The selection of further primers is explicitly possible, 
which makes possible the amplification of a partial sequence 
of the sequence described in figure 1, within which variable 
nucleotide positions are located in order to differentiate 
the alleles 1 to 4 , 

3. Proof of the allelic state in the isolated or enhanced 
sequence fragment of figure 1, preferably within the partial 
sequence, which is flanked by CSNlSlprolf and CSnlSlprolr. 

In order to determine the allelic state, various standard 
techniques are available which are well known by the expert: 
The sequencing according to Sanger et al. 1977, through an 
illustration of single-strand conformation polymorphisms 
(SSCP, Orita et al. 1989, Genomics 5, 874-879), restriction 
fragment length polymorphisms (RFLP; Botstein et al. 1980, 
American Journal of Human Genetics 32, 314-331) and PCR-RFLP 
(Damiani et al. 1990, Animal Genetics 21, 107-114; Medrano & 




Aguilar-Cordova 1990, Animal Biotechnology 1,13-11) r allele- 
specific PGR (= ARMS, ASPCR, PASA; Newton et al. 198 9, 
Nucleic Acids Research 11, 2503-2516; Sakar et al. 1990, 
Analytical Biochemistry 186, 64-68; David & Deutch 1992, 
5 Animal Genetics 23, 425-429) , oligonucleotide-ligation assay 
(= OLA; Beck et al. 2002, J Clinical Mikrobiol 40, 1413- 
1419) , temperature gradient gel electrophoresis (= TGGE, Tee 
et al. 1992, Animal Genetics 23, 431-435) and analogical 
procedures belonging to the technical state of the art. 

10 It is suggested to furnish the primers based on the current 
invention with a marker (fluorescent, radioactive or 
similar) and to determine the allelic state with a 
sequencing machine, autoradiography or chemiluminescence . If 
non-marked primers are utilized, the determination of the 

15 allelic state is carried out by illustrating the fragments 
according to gel electrophoresis through coloring of the 
nucleic acids, e.g. with ethidiumbromid (Sambrook et al., 
1989) or through the silver-coloring procedure (Bassam et al 
1991, Analytical Biochemistry 196, 80-83) . 

20 Furthermore, it is possible to utilize different high 
throughput methods for the mutation screening, including the 
utilization of oligonucleotide arrays (Dong et al 2001, 
Genome Research 11, 1418-1424), the TaqMan procedure (Ranade 
et al 2001, Genome research 11, 1262-1268), the fluorescence 

25 polarization method, (Chen et al 1999, Genome Research 9, 
492-498), mass spectrometric method (MALI-TOF; Sauer et al. 
2002, Nucleic Acids Research 30, e22). This enumeration is 
exemplary and is not to be understood as limited. 

The allelic state is hereby to be understood as the 
30 existence of a certain nucleotide sequence within the 
enriched fragment. Figure 2 exemplifies the nucleotide 
sequence of four different allelic states of the marker 
based on the current invention (figure 2, alleles 1, 2, 3 
and 4 ) . 
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In the case of sequencing, a comparison with the 
corresponding nucleotide sequences 1, 2, 3 and 4 in figure 2 
must be carried out in order to establish the analogical 
correlations between the allele types 1 to 4 . Based on the 
5 indicated nucleotide sequences, it is possible for a person 
familiar with the state of the art to determine the lengths 
of the fragments through a PCR-RFLP analysis or to conceive 
oligonucleotides for the detection through allele-specif ic 
PGR. Also, the adjustment of the other aforementioned 
10 techniques for a mutation screening can be done by the 
expert . 

Particularly preferable is the illustration of the allelic 
states by means of single-strand conformation polymorphisms 
(SSCP) , as the allelic state can be read directly from the 

15 fragment pattern. In addition to the detection of the 4 
alleles described here, the procedure also enables the 
recognition of further mutations which are not described 
here. For that reason it is also particularly well suited 
for the analysis of the homologous genome region of other 

20 animal species than cattle. In order to reduce the duration 
of the gel electrophoresis, it is recommended to utilize, 
instead of the entire sequence, a shorter fragment, e.g. the 
sequence marked with an arrow in figure 1 which is defined 
by the oligonucleotides based on the current invention. 

25 Figure 4 shows a schematic illustration of the alleles 1 to 
4 of the marker CSNISI in the SSCP analysis in a 12% 
acrylamide/bisacrylamide gel (49:1) with a 1% glycerol 
additive. The fields 1 to 4 represent the four different 
separation patterns of the alleles. The migration direction 

30 of the molecules in the electrical field from the cathode (- 
) to the anode ( + ) is represented by an arrow. The single 
strands of the alleles show a typical, clearly different 
separation pattern one from another. Since by means of 
silver coloring both DNA single strands are illustrated, 

35 each allele is characterized by two bands. 
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4. Selection of organisms which carry the respectively 
preferable allelic state of the marker based on the current 
invention. This can be e.g. the allelic state 1 or 4, which 
differs from allele 2 by the amount of potential binding 
5 sites for transcription factors. 
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Praci:ical embodimen-bs 

1. Procedure to classify the milk production traits through 
5 determination of the allelic state of the marker based on 
the current invention. 

Cattle blood is used as source material. The isolation of 
the genetic material (genomic DNA) is carried out according 
to the high-salt method of Montgomery & Sise (1990^ NZ J 
10 Agric Res 33, 437-441). 

In order to realize the amplification of the marker by means 
of PGR reaction the oligonucleotic sequences based on the 
current invention are utilized as primers: 

Primer 1 CSNlSlprolf (5' GAA TGA ATG AAC TAG TTA CC 3') 

15 Primer 2 CSNlSlprolr (5' GAA GAA GCA GCA AGC TGG 3') 

In 15]il, the reaction solutions respectively contain 20- 
lOOng genomic DNS to be tested, lOpmol of each 
oligonucleotide CSNlSlprolf and CSNlSlprolr, 0.5 U Tag DNA 
polymerase (Peqlab Biotechnologie, Erlangen) , 50 pM dNTPs in 

20 a standard buffer (lOmM Tris-HCl ph 8.8, 50 mM KCl, 1 . 5mM 
MgCl2) • The temperature program (of the thermo cycler, model 
iCycler of the company Biorad) is selected as follows: 1 
min. - 93*'C (Ix) , (40 sec - 91°C, 40 sec. 57*'C, 40 sec - 
70°C) (30x) and 3 min - 70°C (Ix) . Afterwards, cooling to 

25 4'*C takes place. 

Afterwards, to each reaction solution, 25|al of a formamide 
denaturation buffer (95% formamid, 0.025% (w/v) bromphenol 
blue, 0.025% (w/v) xylencyanol FF, 20 mM EDTA) are added 
respectively, the mixture is heated for 2 min at 93°C, 
30 cooled down in ice water and every 4pl of the mixture are 
loaded onto a 12% acrylamide/bisacrylamide gel (49:1) with a 
1% glycerol additive. The separation is carried out over 20h 
at 420V and 10°C in a vertical electrophoresis system, 
utilizing the square strip container Penguin P9DS (OWL 

15 




Scientific, Woburn, USA) with a gel of 0.8 mm thickness and 
a size of 16xl6cm. As running and gel buffer, 0 . 5x TBE has 
been utilized. After completion of the electrophoresis, the 
gels have been colored with silver nitrate according to the 
5 protocol of Bassam et al. (1990, Analytical Biochemistry 
196, 80-83) . The development reaction has been stopped by 
transferring the gels in an ice-cold 0.04M EDTA solution. 

The migration pattern of alleles 1 to 4 is shown 
schematically in figure 3. As by means of silver coloring, 
10 both (coding and non-coding DNA line) are colored, two 
fragments per allele are available respectively. 

2 . Illust:rat:lon of t:he var±al3lllt:y o£ t:lie marker CSNISI in 
various ca't'tle breeds 

15 From DNA of 83 cattle of the breeds German American Holstein 
(6 cattle), German Red cattle (4 cattle). Yellow cattle (7 
cattle), German Holstein (18 cattle). Black and White (9 
cattle), Jersey (13 cattle), Pinzgauer (20 cattle) and 
Simbrah (6 cattle) the nucleic acid sequence position 1 to 

20 655, as mentioned in figure 1, is amplified with the 
oligonucleotides, based on the current invention, 
CSNlSlprolf (5' GAA TGA ATG AAC TAG TTA CC 3') and 
CSNlSlprolr (5' GAA GAA GCA GCA AGC TGG 3') by means of PGR. 
The further procedure takes place as described in example 1. 

25 

In the breeds examined, the typical separation patterns, as 
revealed in figure 3, appear. 

3 . lllust:rat:±on of i:he varlabil±t:y of -the marker CSNISI 
30 within -the German Holst:eln breed 

From blood samples of 503 cows of the breed German Holstein, 
DNA is isolated according to the method of Mongomery & Sise 
(1990, NZ J Agric Res 33, 437-441) . The enrichment of the 
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sequence given in figure 2 is achieved with the primers 
based on the current invention, as described above. The 
illustration of the existing variations is achieved by means 
of SSCP-Technology . Among the cows examined of this breed, 
5 all four alleles are also detectable. The following allele 
frequencies were determined: 

Allele 1 - 0,031 Allele 2 - 0,739 

Allele 3 - 0,194 Allele 4 - 0,036 

10 Thus, allele 2 represents the most frequent allele in the 
German Holstein breed, followed by allele 3 and the two rare 
alleles 1 and 4 . 

The genotypes occurred in the frequency 22 > 23 > 24 > 12 > 
33 > 34. The genotypes 11 and 14 as well as the combination 
15 of these two rare alleles (genotype 14) were not found. 

4 . Gene^blc mapping of -bhe marker CSNISI 

By means of the procedure, based on the current invention, 
eight half -sib families of the breeds German Holstein (7) 

20 and Black and White (1) are genotyped with the marker CSNISI 
and subsequently compared with the results obtained by 
Thomsen et al. 2000 (J" Anim Breed Genet 117, 289-306), who 
already has genotyped these families for 10 further markers 
on BTA 6 (microsatellite marker) and has established a 

25 linkage map. The genotyping data for CSNISI are integrated 
into this existing data record. Mapping utilizing the BUILD 
function of the program package CRI-MAP (version 2.4, Green 
et al. 1990, Documentation of CRI-MAP, Washington School of 
Medicine, St. Louis, MO, USA) leads to two possible 

30 locations of the CSNISI marker: between the markers IL97 and 
FBN14 or FBN14 and CSN3. The additionally realized FLIPS 
analysis leads to a definitive mapping of CSNISI between the 
markers FBN14 and CSN3. The total length of the linkage map 
of BTA6, calculated with these 11 markers, is 161.1 cM. The 
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position of all markers included in the linkage map, and the 
corresponding indications from the existing linkage maps 
MARC97 and IBRP97, can be seen in the following table 1. 
Given are the markers to establish the linkage map of BTA6, 
5 the number of informative meioses as well as the positions 
(cM) on the genetic map calculated with CRI-MAP (the map 
based on the current invention is referred to as "ADR") in 
comparison to the two linkage maps MARC97 and IBRP97. For 
those markers indicated with "n.a.", no mapping is available 
10 in the respective linkage maps. 



Marker 


Informative 


Position (cM) 






ADR 


MARC 9 7 


IBRP97 


ILSTS93 


193 


0.0 


0.0 


16.0 


ILSTS90 


156 


28.5 


11.8 


0.0 


BM1329 


141 


56. 8 


35.5 


45.0 


URB16 


228 


57. 9 


n. a . 


40.0 


DIK82 


356 


78.5 


n. a . 


67.0 


ILSTS097 


78 


99. 6 


67.2 


89.0 


FBN14 


187 


104.1 


n. a . 


n.a. 


CSNISI 


280 


108. 1 


(like CSN3) 


(like CSN3) 


CSN3 


102 


113.5 


82. 6 


103. 0 


BP7 


208 


123. 6 


91.2 


n.a. 


BMC4203 


186 


161. 1 


112.9 


n.a. 



Table 1 



5. Variance analysis for es'blma'tlng t:he effects on tJie m±lk 
production i:ralt:s 

15 By means of the procedure based on the current invention a 
total amount of 729 bulls from 9 half-sib families of the 
breeds German Holstein and Simmental were genotyped with the 
marker CSNISI. The distribution of the genotypes within the 
9 half -sib families is shown in table 2. 

20 
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Faitii 1 y 


n 
ii 


CSNISI Genotype 






1 9 


1 


1 A 




O 


O A 
Z ft 






1 


7 Q 








Q 


X u 








2 


7 HP 

X L/ O 


A Q 
ft O 


c 
o 


D 


J / 


Q 


A 

ft 






^ 


J. u o 


4 




3 




1 n 


O / 




1 9 
X z. 


4 


27 


12 


3 


2 




10 








5 


12 




1 




5 


5 




1 




6 


27 








9 


16 


1 


1 




7 


55 


1 




1 


22 


4 


23 




4 


8 


56 


4 


2 




26 


17 


3 


1 


3 


9 


319 


10 






250 


50 


9 






total 


729 


79 


11 


11 


398 


131 


11 


3 


19 



Table 2 



The breeding values of the bulls are centrally estimated by 
5 the United Information Systems Animal Production (Vereinigte 
Inf ormationssysteme Tierhaltung - VIT) in Verden. A total 
amount of more than 150^000 daughters and their performance 
data are integrated in the estimation of the breeding 
values. From all bulls, deregressed breeding values, 

10 concerning the milk yield, the protein and fat yield, the 
protein content (in %) and the fat content (in %), are 
utilized in the variance component estimation. The 
deregression of the breeding values is carried out as 
described by Thomsen et al. (2001, J Anlm Breed Genet. 118, 

15 357-370) . 

The variance component estimation is carried out using the 
program package SAS. First, as unique fixed effect, the 
marker CSNISI is considered in the model, because other 
influence factors (e,g. operational effects, milking 
20 frequency) are already corrected in the frame of the 
estimation of the breeding value and the deregression 
(influence of the sires) . The analysis reveals significant 
effects of the marker CSNISI on all studied traits 




(deregressed breeding values for protein percentage 
(DRG_PP), milk yield (DRG_MY1), fat yield (DRG_FY1), protein 
yield (DRG_PY1) , fat percentage (DRG_FP) ) . Table 3 shows the 
effect of CSNISI on deregressed breeding values for milk 
5 production traits, indicating also the probability of error 
(p) for the effects on the individual traits. 



Trait 


Probability of error (p) 


DRG-PP 


< 0.0001 


DRG_MY1 


0.0011 


DRG_FY1 


0.0016 


DRG_PY1 


0.0056 


DRG_FP 


0.0052 



Table 3 



The highest significance is calculated for the effect on 
DRG_PP. As the examined marker CSNISI is located directly 
10 within the regulatory region of a milk protein gene, this 
could be an indication of a direct effect. The marker CSNISI 
fulfils the requirements to a functional candidate gene. 

The highest breeding value for milk (DRG_MY1) is achieved on 
average by bulls with the genotype 12, whereas the highest 

15 breeding values for protein percentage {DRG_PP) are found 
within the group with genotype 24. Table 4 shows a 
compilation of the least square means (LS_means) for the 
groups with the genotypes 12, 22, 23 and 24. The table 
displays the LS_means as well as standard errors for the 

20 deregressed breeding values for milk yield (DRG_MY1) and 
protein percentage (DRG_PP) in groups with different CSNISI 
genotypes . 
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CSNISI 
type 




LSMEAN ± se 




n 


DRG_MY1 


DRG_PP 


12 


79 


198.232 ± 15.700 


- 0.00022534 ± 0.00006470 


22 


398 


155.341 ± 6.995 


- 0.00037495 ± 0.00002921 


23 


131 


138.806 ± 12.192 


- 0.00038405 ± 0.00005271 


24 


76 


112.364 ± 16.007 


0.00008175 ± 0.00006650 


Alle 


684 


152.353 


-0. 000307 



Table 4 



In order to obtain a more exact clarification, the variance 
analysis is repeated within individual families and groups 
of families with identical genotypes. Hereby is revealed, 
5 that the effect on the milk yield can not be confirmed in 
all families. In family 9, in which the sires exclusively 
passed down the allele 2, the only remaining effect is 
encountered close to the 5% threshold of significance for 
DRG_PP (p = 0.0610). Furthermore, a comparison of the 

10 LS_means for the traits DRG_MY1, DRG_PP, DRG_FP is carried 
out for all groups of genotypes and within each individual 
family, and it is proved whether the difference of the LS- 
means between the genotypes 12, 23 and 24 and the most 
frequent genotype 22 is significant. The results are 

15 graphically illustrated in figure 5. 
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